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© Her3: A novel EGF receptor homolog. 



© The cloning, sequence and expression of a novel member of the EGFR/new subfamily of receptor 
tyrosine kinases, termed HER3, is described. The HER3 precursor comprises 1342 amino acid residues, 
1323 residues of which comprise the mature HER3. Recombinant HER3 does not interact with the 
homologous ligands EGF and TGF-a. A 160 kD phosphoprotein is present in cells expressing both 
HER3 and membrane-associated Amphiregulin, whereas the 170 kD EGF receptor is completely absent. 
Immunoprecipitation experiments suggest this 180 kD phosphoprotein is HER3 and that it is present 
only when coexpressed with the integral membrane form of the AR precursor. 
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HER3: A NOVEL EGF RECEPTOR HOMOLOG 



1. INTRODUCTION 

The present invention is directed to HER3, a novel receptor tyrosine kinase which is related to the Epidermal 
5 Growth Factor Receptor (EGFR) and HER2/neu , to nucleotide sequences encoding HER3, and to methods for 
the generation of DNAs encoding HER3 and their gene products. The HER3 DNAs and polypeptides of the 
invention may be useful in the diagnosis and treatment of certain abnormalities involving HER3 and/or EGFR 
and HER2/r?eu, such as certain neoplastic diseases. 

10 2. BACKGROUND OF THE INVENTION 

Cell growth and differentiation are regulated in part by the specific interaction of secreted growth factors 
and their membrane-bound receptors. Receptor-ligand interaction results in activation of intercellular signals 
leading to specific cellular responses. EGF, platelet-derived growth factor (PDGF), insulin, insulin-like growth 

15 factor-1, colony stimulating factor-1, and fibroblast growth factor all transmit their growth modulating signals 
by binding to and activating receptors with intrinsic tyrosine kinase activity (reviewed in Hanks et aL, 1 988, Sci- 
ence 241 : 42-52). Characterization of the physiologic and chemical effects that result from the binding of EGF 
and TGF-a to the EGF receptor has served as a useful model for understanding receptor-ligand interactions, 
signal transduction, and the regulation of cell growth and oncogenesis (reviewed in Yarden and Ullrich, 1988, 

20 Biochemistry 27: 31 1 3-19) 

The EGF receptor (EGFR) is a 170 kD transmembrane protein possessing intrinsic tyrosine kinase activity 
(reviewed in Hunter and Cooper, 1985, Ann. Rev. Biochem. 54: 897-930). HER2//?ei/ is a 185 KD transmem- 
brane receptor protein and the product of the neu proto-oncogene; HER2//7eu is structurally similar to EGFR 
and, like EGFR, HER2Jneu also possesses tyrosine kinase activity (Stern et aL, 1986, Mol. Cell. Biol. 8: 1729- 

25 40; Akiyama et aL, 1986, Science 232: 1644-46). The ligand for HER2/neu is not known. Reports from several 
groups indicate that both EGFR and HER2//?eu are involved with the development and/or maintenance of 
human cancers. In this regard, overexpression of EGFR in squamous cell carcinomas and glioblastomas, and 
overexpression of HER2/neu in breast and ovarian carcinomas, have been reported. 

30 2.1. AMPHIREGULIN 

Amphiregulin (AR) was originally identified from phorbol ester treated human breast carcinoma cells, based 
on its ability to inhibit the growth of several carcinoma cell lines while stimulating proliferation of normal cells 
The cloning and expression of Amphiregulin is described in co-pending GB-A-2 214 185, which application is 
35 incorporated by reference herein in its entirety. AR shares 38% sequence identity with Epidermal Growth Factor 
(EGF), and conserves most of the residues proposed to be involved in binding to the EGF receptor (EGFR). 
However, mature AR contains a distinct amino-terminal sequence of 43 predominantly hydrophilic amino acids, 
not present in other EGF-like proteins. 

40 3. SUMMARY OF THE INVENTION 

The present invention is directed to HER3, a novel receptor tyrosine kinase related to the EGF receptor 
and HER2/neu, nucleic acid molecules and vectors encoding HER3, methods for the recombinant synthesis 
of HER3, anti-HER3 antibodies, and diagnostic and therapeutic uses thereof. 
45 In specific embodiments, described by way of the examples in Sections 6 and 7, infra, the cloning and exp- 

ression of HER3/c-erbB-3 is described. The 6 kb HER3/c-erbB-3 transcript has been identified in various human 
tissues. HER3, while structurally similar to EGFR and HER2//7ew, does not bind EGF or TGF-a. However, HER3 
is specifically phosphorylated when co-expressed with the Amphiregulin precursor. 

50 4. BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 Nucleotide sequence and deduced amino acid sequence of HER3 (1342 residues). Nucleotides are 
numbered on the left, and amino acids are numbered above the sequence. The 1 9 residue signal sequence is 
underscored with a single line, and the predicted amino-terminus of the mature protein is indicated by +1. The 
55 transmembrane domain is marked with a double underline at amino acids 614 - 645. Potential N-linked gly- 
cosylation sites (NXS, or NXT) are denoted with -CHO-. No poly(A) tail was present in the HER3 clones. This 
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sequence represents a consensus based on cDNA and PCR clones isolated from MDA-MB-361 RNA, and was 
used in subsequent expression constructs. Clonal sequence differences are listed below the consensus resi- 
due, and include: A 441 to G in one of 4 PCR clones; G 1877 to A from A431 cDNA clone; A 3619 to G in one cDNA 
clone; A 3642 to G in one cDNA clone. 

5 FIG. 2 Hydropathy profile of HER3 and comparison of protein domains for HER3 (1342 amino acids), EGFR 

(1210 amino acids), and neu/c-erbB-2 (1255 amino acids). The signal peptide and transmembrane domains 
are represented by filled boxes, the cysteine-rich extracellular subdomains are hatched, and the cytoplasmic 
tyrosine kinase domain is stippled. The percent amino acid sequence identity between HER3 and representa- 
tives of other classes of tyrosine kinases are indicated. Sig, signal peptide; I, II, IN, and IV, extracellular domains; 

10 TM, transmembrane domain; 3 r UTR t 3' untranslated region; HIR, insulin receptor; PDGFR, platelet-derived 
growth factor receptor; ND, not determined. 

FIG. 3 Protein sequence comparison between members of the human EGF receptor family. Sequences 
are displayed using the single-letter code, with identical residues denoted with dots. Gaps are introduced for 
optimal alignment and are show by a dash. The signal sequences are bounded by single lines, the transmem- 

15 brane domains are bounded by double lines, and sequences used to derive the probes and primers (ARRD1, 
ARRD2) used for cloning are underlined. Cysteine residues are marked with asterisks, the potential ATP-bind- 
ing site is shown with circled crosses (G-X-G-X-X-G beginning at residue 697, and K at 723), carboxyl-terminal 
tyrosines are denoted with open triangles, and additional residues of HER3 that are referred to in Section 6., 
infra are marked with arrows (A 657 , C™, H 740 and 

20 FIG. 4 Differential effects of AR on EGFR and HER3. (A) Immunoblot analysis of the recombinant HER3 

transiently expressed in COS cells. The complete HER3 coding sequence was reconstructed in a cDM8 exp- 
ression vector as described in Section 7.1.1, infra. Three days after transfection, cells werelysed in SDS-sam- 
ple buffer, protein (100 jig per lane) was resolved on 7% SDS gels, and immunoblotted using HER3-specific 
antisera. Lane 1, HER3 transfected COS cells; lane 2, cDM8 mock transfected COS cells. Note the presence 

25 of a 1 60 kD band. (B) Amphiregulin blocks phosphorylation of EGFR and stimulates phosphorylation of HER3. 
Immune complex kinase assays were performed on COS cells transfected with cDM8 vector alone (lanes 1-3), 
pH3ARP (lane 4), cHER3x (lanes 5-7), or cHER3x and pH3ARP (lane 8), and treated with EGF (lanes 2, 6), 
AR (lanes 3, 7), or buffer control (lanes 1, 4, 5, 8) as described in Section 7.1.2. infra. (C) Specific immunop- 
recipitation of phosphorylated HER3 from in vivo labeled COS cells transfected with cHER3x and pH3ARP (lane 

30 1) compared to mock transfected controls (lane 2). 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present Invention is directed to what applicants have termed "HER3", a closely related yet distinct 
35 member of the Human EGF Receptor (HER)/net/ subfamily of receptor tyrosine kinases, HER3-encoding nuc- 
leic acid molecules (e.g., cDNAs, genomic DNAs, RNAs, anti-sense RNAs, etc.), production of mature and pre- 
cursor HER3 from a HER3 gene sequence and its product, recombinant HER3 expression vectors, HER3 
analogues and derivatives, HER3 antibodies, and diagnostic and/or therapeutic uses of HER3 and related pro- 
teins, HER3-encoding nucleic acid molecules, anti-HER3 antibodies, HER3 ligands, and HER3 antagonists. 

40 

5. 1. PRODUCTION OF RECOMBINANT HER3 

Mature biologically active HER3 may be produced by the cloning and expression of the HER3 precursor 
coding sequence or its functional equivalent in a host cell which processes the precursor correctly. Functional 

45 equivalents of the HER3 precursor coding sequence include any DNA sequence which, when expressed inside 
the appropriate host cell, is capable of directing the synthesis, processing and/or export of mature HER3. 

Production of an HER3 using recombinant DNA technology may be divided into afour-step process for the 
purposes of description: (1) isolation or generation of the coding sequence (gene) for a precursor or mature 
form of the desired HER3; (2) construction of an expression vector capable of directing the synthesis of the 

50 desired HER3; (3) transfection or transformation of appropriate host cells capable of replicating and expressing 
the HER3 gene and/or processing the gene product to produce the desired HER3; and (4) identification and 
purification of the desired HER3 product. 

5.1.1. ISOLATION OR GENERATION OF THE HER3 GENE 

55 

The nucleotide coding sequences of HER3, or functional equivalents thereof, may be used to construct 
recombinant expression vectors which will direct the expression of the desired HER3 product. The nucleotide 
coding sequence for HER3 is depicted in FIG. 1. In the practice of the method of the invention, the nucleotide 
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sequence depicted therein, or fragments or functional equivalents thereof, may be used to generate the recom- 
binant molecules which will direct the expression of the recombinant HER3 product in appropriate host cells. 
HER3-encoding nucleotide sequences may be obtained from a variety of cell sources which produce HER3-iike 
activities and/or which express HER3-encoding mRNA. Applicants have identified a number of suitable human 

5 cell sources for HER3 including epidermoid and breast carcinoma cells, normal colon, kidney, and brain. 

The HER3 coding sequence may be obtained by cDNA cloning from RNA isolated and purified from such 
cell sources or by genomic cloning. Either cDNA or genomic libraries of clones may be prepared using techni- 
ques well known in the art and may be screened for particular HER3-DNAs with nucleotide probes which are 
substantially complementary to any portion of the HER3 gene. Full length clones, i.e., those containing the 

10 entire coding region of the desired HER3 may be selected for constructing expression vectors. Alternatively, 
HER3-encoding DNAs may be synthesized in whole or in part by chemical synthesis using techniques standard 
in the art. 

Due to the inherent degeneracy of nucleotide coding sequences, other DNA sequences which encode sub- 
stantially the same or a functionally equivalent amino acid sequence may be used in the practice of the method 

15 of the invention. Such alterations of HER3 nucleotide sequences include deletions, additions or substitutions 
of different nucleotides resulting in a sequence that encodes the same or a functionally equivalent gene product. 
The gene product may contain deletions, additions or substitutions of amino acid residues within the sequence 
which result in silent changes thus producing a bioactive product. Such amino acid substitutions may be made 
on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nat- 

20 ure of the resides involved. For example, negatively charged amino acids include aspartic acid and glutamic 
acid; positively charged amino acids include lysine and arginine; amino acids with uncharged polar head groups 
or nonpolar head groups having similar hydrophilicity values include the following: leucine, isoleucine, valine; 
glycine, alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. 

25 5. 1 .2. CONSTRUCTION OF HER3 EXPRESSION VECTORS 

In order to express biologically active, precursor or mature forms of HER3, an expression vector/host sys- 
tem should be chosen which provides not only for high levels of transcription and translation but for the correct 
processing and post-translational modification of the gene product. This may be especially important when 

30 employing the entire coding sequence of the HER3 precursor in the expression constructs since the mature 
form of HER3 appears to be derived from a larger precursor via cellular processing events. 

A variety of animal/host expression vector systems (i.e., vectors which contain the necessary elements for 
directing the replication, transcription and translation of the HER3 coding sequence in an appropriate host cell) 
may be utilized equally well by the skilled artisan. These include, but are not limited to, virus expression vec- 

35 tor/mammalian host cell systems (e.g., cytomegalovirus, vaccinia virus, adenovirus, and the like); insect virus 
expression vector/insect cell systems (e.g., bacuiovirus); or nonviral promoter expression systems derived from 
the genomes of mammalian cells (e.g., the mouse metallothionein promoter). 

The expression elements of these vectors vary in their strength and specificities. Depending on the 
host/vector system utilized, anyone of a number of suitable transcription and translation elements maybe used. 

40 For instance, when cloning in mammalian cell systems, promoters isolated from the genome of mammalian 
cells, (e.g. mouse metallothionein promoter) or from viruses that grow in these cells, (e.g. vaccinia virus 7.5K 
promoter or Moloney murine sarcoma virus long terminal repeat) may be used. Promoters produced by recom- 
binant DNA or synthetic techniques may also be used to provide for transcription of the inserted sequences. 
Specific initiation signals are also required for sufficient translation of inserted protein coding sequences. 

45 These signals include the ATG initiation codon and adjacent sequences. In cases where an entire HER3 gene 
including its own initiation codon and adjacent sequences are inserted into the appropriate expression vectors, 
no additional translational control signals may be needed. However, in cases where only a portion of the coding 
sequence is inserted, exogenous translational control signals, including the ATG initiation codon must be pro- 
vided. Furthermore, the initiation codon must be in phase with the reading frame of the HER3 coding sequences 

so to ensure translation of the entire insert These exogenous translational control signals and initiation codons 
can be of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of transcription attenuation sequences, enhancer elements, etc. 

Any of the methods previously described for the insertion of DNA fragments into a vector may be used to 
construct expression vectors containing the HER3 gene and appropriate transcriptional/translational control 

55 signals. These methods may include in vitro recombinant DNA techniques, synthetic techniques and in vivo 
recombinations. 

For example, in cases where an adenovirus is used as an expression vector, the HER3 coding sequence 
may be ligated to an adenoviris transcription/translation control complex, e.g., the late promoter and tripartite 
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leader sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recom- 
bination. Insertion may be into a non-essential region of the viral genome that is viable and capable of expres- 
sing the HER3 in infected hosts. Similarly, the vaccinia 7.5K promoter may be used. 

An alternative expression system which could be used to express HER3 is an insect system. In one such 

5 system, Autographa californica nuclear polyhidrosis virus (AcNPV) is used as a vector to express foreign 
genes. The virus grows in Spodoptera frugiperda cells. The HER3 coding sequence may be cloned into non- 
essential regions (for example the polyhedrin gene) of the virus and placed under control of an AcNPV promoter 
(for example the polyhedrin promoter). Successful insertion of the HER3 coding sequence will result in inacti- 
vation of the polyhedrin gene and production of non-occluded recombinant virus (i.e., virus lacking the pro- 

10 teinaceous coat codes for by the polyhedrin gene). These recombinant viruses are then used to infect 
Spodoptera frugiperda cells in which the inserted gene is expressed. 

Retroviral vectors prepared in amphotropic packaging cell lines permit high efficiency expression in numer- 
ous cell types. This method allows one to assess cell-type specific processing, regulation or function of the 
inserted protein coding sequence. 

15 In addition, a host ceil strain may be chosen which modulates the expression of the inserted sequences, 

or modifies and processes the gene product in the fashion desired. Expression from certain promoters can be 
elevated in the presence of certain inducers, (e.g. zinc and cadmium ions for metallothionein promoters). There- 
fore, expression of HER3 may be controlled. This is important if the protein product of the cloned foreign gene 
is lethal to host cells. Furthermore, modifications (e.g. glycosylation) and processing (e.g., cleavage) of protein 

20 products are important for the function of the protein. Different host cells have characteristic and specific 
mechanisms for the post-translational processing and modification of proteins. Appropriate cell lines or host 
systems can be chosen to ensure the correct modification and processing of the foreign protein expressed. 

5.1.3. IDENTIFICATION OF TRANSFECTANTS OR TRANS FORMANTS EXPRESSING HER3 GENE 
25 PRODUCTS 

The host cells which contain the recombinant coding sequence and which express the biologically active, 
mature product may be identified by at least four general approaches (a) DNA-DNA, DNA-RNA or RNA-anti- 
sense RNA hybridization; (b) the presence or absence of "marker" gene functions; (c) assessing the level of 
30 transcription as measured by the expression of HER3 mRNA transcripts in the host cell; and (d) detection of 
the mature gene product as measured by immunoassay and, ultimately, by its biological activities. 

In the first approach, the presence of HER3 coding sequences inserted into expression vectors can be 
detected by DNA-DNA hybridization using probes comprising nucleotide sequences that are homologous to 
the HER3 coding sequence. 

35 In the second approach, the recombinant expression vector/host system can be identified and selected 

based upon the presence or absence of certain "marker" gene functions (e.g., thymidine kinase activity, resi- 
stance to antibiotics, resistance to methotrexate, transformation phenotype, occlusion body formation in 
baculovirus, etc.). For example, if the HER3 coding sequence is inserted within a marker gene sequence of 
the vector, recombinants containing that coding sequence can be identified by the absence of the marker gene 

40 function. Alternatively, a marker gene can be placed in tandem with the HER3 sequence under the control of 
the same or different promoter used to control the expression of the HER3 coding sequence. Expression of 
the marker in response to induction or selection indicates expression of the HER3 coding sequence. 

in the third approach, transcriptional activity for the HER3 coding region can be assessed by hybridization 
assays. For example, polyadenylated RNA can be isolated and analyzed by Northern blot using a probe 

45 homologous to the HER3 coding sequence or particular portions thereof. Alternatively, total nucleic acids of 
the host cell may be extracted and assayed for hybridization to such probes. 

In the fourth approach, the expression of HER3 can be assessed immunologically, for example by Western 
blots, immunoassays such as radioimmunoprecipitation, enzyme-linked immunoassays and the like. The ulti- 
mate test of the success of the expression system, however, involves the detection of the biologically active 

50 HER3 gene product. Where the host cell secretes the gene product the cell free media obtained from the cul- 
tured transfectant host cell may be assayed for HER3 activity. Where the gene product is not secreted, cell 
lysates may be assayed for such activity. In either case, assays which measure ligand binding to HER3 or other 
bioaetivities of HER3 may be used. 

55 5. 2. HER3 DERIVATIVES, ANALOGS AND PEPTIDES 

The production and use of derivatives, analogues, and peptides related to HER3 are also envisioned and 
are within the scope of the invention. Such derivatives, analogues, or peptides may have enhanced or dimi- 
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nished biological activities in comparison to native HER3. HER3-related derivatives, analogues, and peptides 
of the invention may be produced by a variety of means known in the art. Procedures and manipulations at the 
genetic and protein levels are within the scope of the invention. At the protein level, numerous chemical mod- 
ifications could be used to produce HER3-Hke derivatives, analogues, or peptides by techniques known in the 
5 art, including but not limited to specific chemical cleavage by endopeptidases {e.g. cyanogen bromides, trypsin, 
chymotrypsin, V8 protease, and the like) or exopeptidases, acetylation, formylation, oxidation, etc. 

5.3. ANTI-HER3 ANTIBODIES 

10 Also within the scope of the invention is the production of polyclonal and monoclonal antibodies which rec- 

ognize HER3 or related proteins. 

Various procedures known in the art may be used for the production of polyclonal antibodies to epitopes 
of HER3. For the production of antibodies, various host animals can be immunized by injection with HER3, or 
a synthetic HER3 peptide, including but not limited to rabbits, mice, rats, etc. Various adjuvants may be used 
is to increase the immunological response, depending on the host species, including but not limited to Freund's 
(complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as 
lysolecithin, piuronic polyols, polyanions, oil emulsions, keyhole iympet hemocyanins, dinitrophenol, and poten- 
tially useful human adjuvants such as BCG (bacilie Calmette-Guerin) and Corynebacterium parvum. 

A monoclonal antibody to an epitope of HER3 can be prepared by using any technique which provides for 
20 the production of antibody molecules by continuous cell lines in culture. These include but are not limited to 
the hybridoma technique originally described by Kohler and Milstein (1 975, Nature 256, 495-497), and the more 
recent human B-cell hybridoma technique (Kosbor et aL, 1983, Immunology Today 4:72) and EBV-hybridoma 
technique (Cole et al., 1985, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Antibody fragments which contain the idiotype of the molecule can be generated by known techniques. For 
25 example, such fragments include but are not limited to: the F(ab') 2 fragment which can be produced by pepsin 
digestion of the antibody molecule; the Fab' fragments which can be generated by reducing the disulfide bridges 
of the F(ab')2 fragment, and the two Fab fragments which can be generated by treating the antibody molecule 
with papain and a reducing agent. 

Antibodies to HER3 may find use in the qualitative detection of mature HER3 and their precursor and sub- 
30 component forms, in the affinity purification of HER3 polypeptides, and in the elucidation of HER3 biosynthesis, 
metabolism and function. Antibodies to HER3 may also be useful as diagnostic and therapeutic agents. 

5.4. USES OF HER3, HER3-ENCODING NUCLEIC ACID MOLECULES AND ANT1-HER3 ANTIBODIES 

35 Applicants envision a wide variety of uses for the compositions of the present invention, including diagnostic 

and/or therapeutic uses of HER3, HER3 analogues and derivatives, HER3-encoding nucleic acid molecules 

and anti-HER3 antibodies. 

HER3-encoding nucleic acid molecules or fragments thereof may be used as probes to detect and quantify 

mRNAs encoding HER3. Assays which utilize nucleic acid probes to detect sequences comprising ail or part 
40 of a known gene sequence are well known in the art HER3 mRNA levels may indicate emerging and/or exiting 

neoplasias as well as the onset and/or progression of other human diseases. Therefore, assays which can 

detect and quantify HER3 mRNA may provide a valuable diagnostic tool. 

Anti-sense HER3 RNA molecules may be useful therapeutically to inhibit the translation of HER3-encoding 

mRNAs where the therapeutic objective involves a desire to eliminate the presence of a HER3. HER3 anti- 
45 sense RNA, for example, could be useful as a HER3 antagonizing agent in the treatment of diseases for which 

HER3 is a causative agent. Additionally, HER3 anti-sense RNAs may be useful in elucidating HER3 functional 

mechanisms. 

HER3-encoding nucleic acid molecules may be used for the production of recombinant HER3 proteins and 
related molecules, as separately discussed in Section 5.1., supra. 

50 

5. 5. MOLECULAR BIOLOGY OF HER3 

EGFR shares structural homology with HER2/neu; 83% amino acid identity in the cytoplasmic kinase 
domain, and 42% homology in the extracellular I igand binding domain where 49 of 50 cysteines are conserved. 
55 Applicants investigated whether the 78 amino acid form of Amphireguiin (AR) interacts with this putative recep- 
tor and homolog of EGFR, since the I igand for neu has not been identified. Experiments were performed on 
cells expressing high levels of human (SKBR-3 and BT474) or rat (B1 04-1-1 and DHFR-G8) net/. Direct binding, 
in vitro kinase assays, proliferation assays, in vivo phosphorylation, and receptor down-regulation assays ail 
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showed no association between the 78 amino acid form of AR and neu (G. Plowman, M. Shoyab, Y. Yarden, 
G. Todaro, and R. Weinberg, unpublished observations). It is possible that largerforms of AR, such as the 35 
kD and transmembrane-bound forms of AR produced in cells transfected with the AR precursor gene, may 
interact with HER2//?eu and/or HER3. Applicants are currently investigating this possiblity. Forthe present, how- 

5 ever, the ligand for neu remains elusive. 

Northern blot analysis on A431 and placenta! RNA provides evidence for the existence of other EGFR-re- 
lated transcripts. A subclone of the human epidermoid carcinoma cell line A431 was isolated (A431-A3) after 
growth in soft agar. These cells have 30-fold increased sensitivity to AR inhibition compared with the parent 
line, yet had no alteration in the number or affinity of EGFR as measured by binding competition assays. 

10 PoIy(A) + RNA was prepared from A431-A3 cells and a cDNA library constructed by specific priming with a 72- 
fold degenerate oligonucleotide complimentary to a stretch of 1 0 amino acids unique to the kinase domains of 
EGFR and neu. An upstream region of 7 amino acids, also conserved in EGFR and neu, was used to design 
a 96-fold degenerate oligonucleotide probe that was used to screen the cDNA library at low stringency condi- 
tions. Additional EGFR clones were eliminated by re hybridization at high stringency with a short EGFR cDNA 

15 probe spanning the same region of the kinase domain. Sixty-nine clones were partially sequenced, revealing 
three that encoded a closely related yet distinct member of the Human EGF Receptor (HER)/ney subfamily of 
receptor tyrosine kinases. A survey of several human tumor cell lines identified the breast carcinoma cell line, 
MDA-MB-361, to be a more abundant source of the 6 kb transcript than A431-A3 cells, so this line was used 
to isolate overlapping cDNA clones spanning the entire coding sequence of HER3. The HER3 transcript was 

20 also detected in a primary breast carcinoma, and normal colon, kidney, and brain. 

Applicants' initial characterization of HER3 demonstrates the following conclusions: HER3 is a novel mem- 
ber of the EGFR subclass of receptor tyrosine kinases; HER3 does not interact with the ligands EGF and TGF-a; 
prolonged exposure to EGF or AR reduces EGFR phosphorylation; and a 160 kD phosphoprotein is present 
in cells expressing both HER3 and membrane-associated AR, whereas the 170 kD EGFR is completely absent. 

25 Immunoprecipitation experiments suggest this 1 60 kD phosphoprotein is HER3 and that it is present only when 
coexpressed with the integral membrane form of the AR precursor. 

Biochemical and biologic evidence suggests that members of the EGF/TGF-a/AR family are notfunctionally 
equivalent. For example, TGF-a has an activity comparable to that of EGF in many in vitro assays: TGF-a has 
an EGFR binding affinity similar to that of EGF that results in autophosphorylation of EGFR, TGF-a induces 

30 similar changes in intracellular pH and calcium levels as EGF and it transduces a proliferative signal indisting- 
uishable from that of EGF; however, TGF-a appears more potent than EGF in several in vivo assays, including 
the promotion of bone resorption, wound healing, and angiogenesis. Similarly, AR can be distinguished from 
EGF and TGF-a by its EGFR binding activity and by a number of biologic assays. 

The identification of a novel member of the EGFR/neu subclass of RTKs reveals a greater repertoire by 

35 which the EGF/TGF-a/AR family of ligands can exert their diverse growth regulatory signals. Applicants' data 
suggests that some of the differences in biological activity between AR and EGF/TGF-a may result from AR's 
ability to interact differentially with both EGFR and HER3. Since EGFR and neu have been shown to act syner- 
gistically (Kobai et al., 1989, Cell 58: 287-92), it is also conceivable that HER3 may interact with either of these 
two receptors. Moreover, the disparity in response to these ligands may be reflected in the less conserved 

40 cytoplasmic region of HER3 compared with EGFR and neu. The consequences of the amino acid changes in 
the HER3 kinase region, as well as the lack of sequence homology to EGFR in its autophosphorylation domain, 
are under investigation. The carboxy-terminal region of EGFR appears to act as a competitive inhibitor for 
exogenous substrate phosphorylation and thereby serves to fine-tume its kinase activity (Margolis et al., 1 989, 
J. Biol. Chem. 264: 10667-71; Honegger et al., 1988, EMBO 7: 3045-52). The more conserved homology be- 

45 tween the cytoplasmic regions of EGFR and neu t as opposed to HER3, suggests these two receptors share a 
more similar function, whereas the distinctiveness of HER3 suggests that it may be subject to different regu- 
lation and might signal an alternate biochemical response. 

6. EXAMPLE: cDNA CLONING OF HER3 

50 

6. 1. MATERIALS AND METHODS 

6.1.1. OLIGONUCLEOTIDE PRIMERS/PROBES AND HYBRIDIZATION CONDITIONS 

55 The oligonucleotide mixtures (including their degeneracy or length and corresponding amino acid residues) 

which were used for priming and screening the Xgt10 library were: 
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ARRDl 



5 1 -ATCATCCARCAYTTDACCATDATCA2RTA-3 • r 



(72-fold, 



YMIMVKCWMI) 



5 



ARRD2 



5 f -GCCATCCAYTTDATNGGNAC-3 f , (96-fold, VPIKWMA) 

5 * -TCGTCGACTCCTTCACCACTATCTCA—3 1 , (26-iner, EIWKD) 

5 1 -TGGCGTCGACCTATCTCAGCATCTCGGT-3 1 r <28-mer, DRDAEIV) 



S729ERR 



S720ERR 



10 



XSCT17 



5 ' -GACTCGAGTCGACATCGATTTTTTTTTTTTTTTTT-3 * , 



(35-mer) 



XSC 



5 > -GACTCGAGTCGACATCG- 3 ' , 



<17-mer) 



All oligonucleotides are designed to the anti-sense mRNA strand, except XSCT1 7 and XSC which are used 
15 as bidirectional primers. Degenerate residues are; D = A, G, or T; N = A, C, G, or T; R = A or G; Y * C or T. 
Oligonucleotides were labeled with y^P-ATP using T4 polynucleotide kinase (3x10® cpm/jag), and hybridization 
was performed at 37°C, overnight with the probe at 2 x 10 6 cpm/ml in oligo hybridization mix (6XSSC, 5X 
Denhardt's, 0.15% sodium pyrophosphate, 0.1 mg/ml salmon sperm DNA, and 0.1 mg/ml tRNA). Multiple 
washes were done in 6XSSC, 42°C. cDNA probes were prepared by random prime 32 P-labeling (specific activity 
20 of 5 - 25 x I0 8 cpm/|ig), and hybridization was performed at 42°, overnight in southern hybridization buffer (50% 
formamide, 5X SSC, 25 mM NaHP0 4 , 1x Denhardt's solution, 10% dextran sulfate and 20 ng/ml denatured 
herring sperm DNA) containing 2 x 10 6 cpm of ^p.jabeled fragment per ml. Filters were washed extensively 
in 0.5x SSC, 0.1% SDS, 65°C, and autoradiographed overnight at -70°C. 

25 6. 1 .3. cDNA LIBRARY CONSTRUCTION AND SCREENING 

Total cellular RNA was extracted from A431 (subclone A3) and MD-MBA-361 cells and poly(A) + RNA was 
isolated. First strand cDNA synthesis was performed on A431-A3 RNA with reverse transcriptase using a 72- 
fold degenerate primer mixture based on the sequence YMIMVKCWMI (ARRD1, see underlined in FIG. 3). A 

30 cDNA library was constructed in A.gt10 (B,C), and 3.0 x 10 s recombinants were screened in duplicate on nit- 
rocellulose filters. The filters were first probed at low stringency with 32 P-Iabeled ARRD2, a 96-fold degenerate 
oligonucleotide (see underlined in Fig. 2). The filters were then stripped and reprobed at high stringency with 
^P-labeled EGFR300, a cDNA fragment corresponding to amino acids 863 - 944 of the EGF receptor. Of 69 
clones that differentially hybridized to ARRD2 and EGFR300, and were therefore chosen for sequence analysis, 

35 3 were found to encode a novel EGFR-related protein (HER3), 6 encoded neu, and 1 0 spanned various regions 
of the EGF receptor. One HER3 clone was used as a probe to screen a Xgt10 cDNA library derived from 
oligo(dT)-primed MD-MBA-361 RNA, and three additional cDNA clones were isolated, one with a 4.5 kb insert 
(pHER3-3b). This clone was estimated to lack approximately 500 bp of the 5' coding region, based on its high 
degree of homology with EGFR. To obtain the 5'-cDNA clone, we used polymerase chain reaction (PCR) as 

40 follows. Single stranded cDNA, synthesized from MD-MBA-361 RNA with S729ERR as a primer, was dA-tailed, 
and used as a template for PCR amplification using S720ERR, XSCT1 7, and XSC as primers. The PCRreaction 
products were cloned into pBluescript II SK+ (Stratagene), with the longest insert containing 701 bp of 5'-HER3 
sequence (pBS5'HER3). Ail cDNA clones and several PCR generated clones were sequenced on both strands 
using T7 polymerase with oligonucleotide primers. 

45 The PCR reaction products were cloned into pBluescript II SK+ (Stratagene), with the longest insert con- 

taining 701 bp of 5'~HER3 sequence (pBS5'HER3). All cDNA clones and several PCR generated clones were 
sequenced on both strands using T7 polymerase with oligonucleotide primers. 

6.2. cDNA SEQUENCE OF HER3 



The nucleotide sequence of HER3 contains an open reading frame of 1342 amino acids beginning with a 
consensus initiating methionine at nucleotide 199 (FIG. 1). The amino adds downstream of this methionine 
have the characteristics of a signal sequence, with the mature protein predicted to begin at S 20 , followed by 
1323 amino acids with a calculated Mr of 146,000. The 4026 nucleotide coding region is flanked by 198 nuc- 
55 leotides of 5' and 755 nucleotides of 3' untranslated sequences. No poly(A) tail is present, suggesting this is 
a partial cDNA sequence. 

HER3 has all the structural features of a receptor tyrosine kinase with a single hydrophobic stretch of 32 
amino acids characteristic of a transmembrane region that divides the sequence into a 612 residue extracellular 



50 
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ligand binding domain and a 677 amino acid carboxy-terminal cytoplasmic domain, and shows strong homology 
with other members of the tyrosine kinase family (FIG. 2, FIG. 3). The ligand binding domain can be further 
divided into 4 subdomains (I - IV), including two cysteine-rich regions (II, IV) that conserve all 48 cysteines pre- 
sent in the corresponding regions of the EGFR, and are thought to provide the structural backbone for domains 

5 I and HI, which are implicated to define specificity for ligand binding. HER3 shares 40 - 50% identity with EGFR 
and 40 - 45% identity with neu in each of these su bdomains. There are 1 0 potential /V-linked glycosylation sites 
in the extracellular domain of HER3, conserving 5 of the 12 potential sites in EGFR and 3 of the 8 sites in neu. 
The cytoplasmic domain includes a stretch of basic residues flanking the membrane-spanning region, a con- 
sensus ATP binding site (G-X-G-X-X-G-X n -K) and sequences homologous to other members of the tyrosine 

10 kinase family, followed by a hydro ph il ic carboxy-terminal tail containing 13 tyrosines (FIG. 3). The kinase 
domain of HER3 is most similar to EGFR and neu (60% and 62%) and shares lower homology (26 - 34%) with 
representatives of other classes of tyrosine kinases (FIG. 2). EGFR and neu have 83% amino acid sequence 
identity between their kinase domains, suggesting that, in this domain, they are more closely related to each 
other than they are to HER3. 

15 The cytoplasmic domain of HER3 also has several unique features (FIG. 3). Protein kinase C-induced phos- 

phorylation of EGFR occurs primarily on T 654 , whereas T 669 has been implicated as a major site for phosphory- 
lation of EGFR in response to EGF. Both of these residues are conserved in neu, but both are altered in HER3 
(A 657 and D 672 ). Recent studies using EGFR mutants suggests that preventing phosphorylation at either of these 
residues has no effect on EGF binding, however, in certain cell lines, phosphorylation of EGFR T 654 by phorbol 

20 esters does appear to block EGF-induced mitogensis. HER3 already contains these "mutations" and might lack 
this negative control mechanism induced by protein kinase C. 

Two additional amino acid changes are uniquely present in the HER3 kinase domain (FIG. 3). The catalytic 
domains of all known protein kinases contain certain residues that are either highly or completely conserved. 
HER3 contains most of these amino acids including; G 697 , G 699 , V 704 , K 723 , N 820 , DFG 833 - 835 , E 882 , D 874 , G 879 

25 and R 938 however HER3 has no n conservative substitutions at C 721 , H 740 , and N 815 . The latter two amino acids 
are present as glutamate and aspartate, respectively, in all known kinases; serine/threonine as well as tyrosine 
kinases (FIG. 2, FIG. 3). Strict conservation of these residues suggests they are important for catalytic activity, 
and the amino acid changes at these positions in HER3 predict it may have altered or absent kinase activity. 
The carboxy-terminal 353 amino acids of HER3 contains 13 tyrosines, several of which are flanked by 

30 numerous charged residues. These features are characteristic of the autophosphorylation domains of EGFR 
and neu. This region of HER3 shares no significant primary sequence homology to either of these two family 
members and is approximately 30 - 50% longer. The sequence YEYM is repeated three times in this domain 
of HER3, possibly the result of gene duplication. 

35 7. EXAMPLE: TRANSIENT EXPRESSION OF HER3 IN COS CELLS 

7.1. MATERIALS AND METHODS 

7.1.1. CONSRUCTION OF EXPRESSION VECTORS AND TRANSFECTIONS 

40 . ^ ~ 

Plasmid cHER3x was constructed for COS expression as follows. PCR clone pBSS'HERS was partially 
digested with Hindi and Smal and religated to eliminate 157 bp of HER3 5 f UTR. The 528 bp Kpnl-Sstl fragment 
was shuttled into pSPT18 (Boehringer) placing Xbal and Sail sites upstream of the coding sequence. This con- 
struct (pSP5'HER3c) contains 41 bp of HER3 5'UTR and the sequence encoding the first 174 amino acids of 

45 the HER3 precursor (to nucleotide 721). A 3' HER3 cDNA clone (pHER3-3b), was digested with EcoRi and 
Kpnl, and the 3.7 kb fragment subcloned into pSPT18. The resultant construct (p3'HER3EK3.7) contains HER3 
sequence from nucleotide 646 to the Kpnl site (nucleotide 4287) 63 bp 3' of the stop codon. A 3.7 kb partial 
Hindlli-Xbal (Hindlll at nucleotide 662 in HER3 sequence) digested fragment was isolated from p3'HER3EK3.7 
and ligated into pBiuescript II SK+ (Stratagene) along with the Sall-Hindlll fragment of pSP5'HER3c, generating 

50 pBSHER3 containing the complete HER3 coding sequence flanked by Xbal sites. The 4.1 kb Xbal fragment 
was inserted into pCDM8 (Invitrogen) placing the HER3 cDNA under the control of the cytomegalovirus pro- 
moter and enhancer (cHER3x). The expression construct pH3ARP (Plowman et aL, 1989, submitted), contains 
the complete coding sequence for the amphiregulin precursor. The expression plasmids were grown in com- 
petent MC1061/P3 cells, and introduced into COS-1 cells using the DEAE-dextran method. Cell pellets were 

55 harvested 72-96 hr after transfection, and analyzed by immunobiotting as described (Plowman et a!., 1 989, sub- 
mitted), except that samples were separated on 7% polyacrylamide gels. Polyclonal antisera was generated 
against synthetic peptides specific to the kinase domain of HER3 (PPDDKQLLYSEAKT, amino acids 841-854; 
GAEPYAGLRLAEVPD, amino acids 889-903). 



10 



EP 0 444 961 A1 

7.1.2. IMMUNE COMPLEX KINASE ASSAYS 

For immune complex kinase assays, 10 cm dishes of transfected COS cells (72 hours post-transfection) 
were washed three times in PBS, and scraped into 1 ml solubilization buffer (20 mM HEPES pH7.35 f 150 mM 

5 NaCI, 10% glycerol, 1% Triton X-1 00, 1.5mM MgCl 2 , 1 mM EGTA, 2 mM phenylmethylsulfonyl fluoride, 1 ug/ml 
aprotinin). After 10 min incubation on ice, cell debris was removed by 12,000 x g centrifugation, 5 min, 4°C. 
The supernatant was then precleared with 50 p\ of a 1:1 slurry of Staphylococcus aureus protein A-Sepharose 
suspended in solubilization buffer, and incubated with 5 \x\ EGFR1 monoclonal antibody (Amersham) and 50 
u.| S. aureus protein A-Sepharose, overnight, 4°G. Immune complexes were washed three times in HNTG (20 

10 mM HEPES, pH 7.35, 150 mM NaCI, 0.1% Triton X-1 00, 10% glycerol), and stored at -7G°C. For autophos- 
phorylation, thawed immunoprecipitates were resuspended in 20 u.l HNTG with 15 mM MnCI 2 . Five u.l 100 uM 
ATP with 15 pCi y^P-ATP was added and incubated for 1 min on ice. The reaction was terminated with 25 p.1 
hot SDS sample buffer, and samples were analyzed on 7% SDS-polyacrylamide gels. 

15 7.2. BIOCHEMICAL PROPERTIES OF RECOMBINANT HER3 

To determine its biochemical properties, we inserted the complete coding sequence of HER3 into an exp- 
ression vector under the control of the cytomegalovirus immediate-early promoter. This vector (cHER3x) was 
transfected into COS cells and the transiently expressed protein was detected by immunoblot analysis using 

20 antisera specific to the cytoplasmic domain of HER3. The recombinant protein migrated with an apparent Mr 
of 160,000, slightly less than the EGFR (FIG. 4). The HER3 expressing cells were tested for specific binding 
of 126 I-EGF or 125 l-TGF-a by cross-linking studies on whole cells. COS cells have high background levels of 
EGFR, however no differences were seen on binding to normal or HER3 expressing cells. The abundance of 
EGFR in COS cells, and difficulties obtaining high specific activity * 25 I-AR, complicated efforts to assess 

25 whether AR is a ligand for HER3. Instead a functional assay was used to demonstrate the specific effects of 
AR on this putative receptor. 

Applicants have recently cloned the AR gene, and found the 78 amino acid secreted glycoprotein to be 
derived from a 252 amino acid transmembrane precursor by proteolytic cleavage. Further, the AR precursor 
is expressed in its membrane bound form in several human breast carcinoma cell lines and in COS cells trans- 

30 fected with an expression vector (pH3ARP) containing the complete AR coding sequence. This expression con- 
struct was used as a source of membrane bound AR to study its effect on the phosphorylation of EGFR and 
HER3, COS cells were transfected with either the HER3 or AR expression construct, or cotransfected with both. 
Two days after transfection, cells were put on serum free media for an additional 24 hr. The cells were solubi- 
lized and immunoprecipitated with a monoclonal antibody (Mab) to EGFR. Immune complexes were incubated 

35 with y^P-ATP and Mn ++ , and resolved by 7% SDS-PAGE. COS cells, transfected with vector control or with 
HER3, showed strong phosphorylation of a single 170 kD protein (FIG. 4B, lanes 1-5), consistent with a high 
basal level of EGFR phosphorylation in COS cells. The amount of this phosphorylated protein was reduced 
significantly following prolonged treatment with EGF or AR, consistent with receptor internalization and degra- 
dation (Fig. 4B, lanes 2, 3, 6, 7). Unexpectedly, cells transfected with the AR precursor construct showed a 

40 complete loss of phosphorylated EGFR (Fig. 4B, lane 4) and absence of an ^S-labeled 170 kD EGFR. These 
cells exhibited an altered structural morphology, with extensive clumping into aggregates of interconnected 
islands, compared with the intact monolayer seen on transfection with the other constructs. Following prolonged 
exposure to EGF or AR, normal COS cells also displayed a disrupted monolayer, but with less aggregation 
than seen in cells expressing the AR precursor. Cotransfection of the HER3 and AR constructs resulted in a 

45 morphology similar to that seen on transfection with AR alone. Furthermore, the EGFR Mab was able to 
immunoprecipitate a new phosphorylated protein of MR 160,000 that comigrates with the recombinant HER3, 
and was only detected in COS cells expressing both AR and HER3 (Fig. 4B, lane 8). 

These experiments suggested that EGFR shares an antigenic determinant with HER3, or that HER3 is 
coimmunoprecipitated with the EGFR Mab due to an association between these two RTK homologs. Alterna- 
te? tively, the 160 kD phosphoprotein could be a smaller form of EGFR, or a tightly associated substrate, that is 
specifically phosphorylated in the presence of both HER3 and the AR precursor, as seen with EGFR and phos- 
pholipase C-ll (Margolis et al„ 1989, Ceil 57: 1 101-07), To explore these possbilities further, in vivo phosphory- 
lation experiments to label transfected COS cells and immunoprecipitated HER3 were performed with peptide 
antisera directed against its kinase domain. A 160 kD phosphoprotein was readily identified in COS cells trans- 

55 fected with HER3 and AR, but not in cells transfected with vector alone (FIG. 4C). 
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Claims 

1. A nucleotide sequence encoding a HER3 precursor comprising the nucleotide coding sequence substan- 
tially as depicted in FIG. 1 from about nucleotide number 1 to about nucleotide number 4978. 

2. A nucleotide sequence encoding a HER3 precursor comprising the nucleotide coding sequence substan- 
tially as depicted in FIG. 1 from about nucleotide number 199 to about nucleotide number 4224. 

3. A nucleotide sequence encoding a mature HER3 comprising the nucleotide coding sequence substantially 
as depicted in FIG. 1 from about nucleotide number 256 to about nucleotide number 4224. 

4. A HER3 precursor comprising the amino acid sequence substantially as depicted in FIG. 1 from about 
amino acid residue number +1 to about amino acid residue number 1323. 

5. A mature HER3 comprising the amino acid sequence substantially as depicted in FIG. 1 from about amino 
acid residue number +1 to about amino acid residue number 1323. 

6. A method for producing HER3 comprising: 

(a) culturing a eucaryotic cell containing a nucleotide sequence encoding HER3 under the control of a 
second nucleotide that regulates gene expression so that a protein having HER3 biological activity is 
produced by the eucaryotic cell; and 

(b) recovering the HER3 from the culture. 

7. The method according to claim 6 in which the nucleotide sequence encoding HER3 comprises the nuc- 
leotide sequence substantially as depicted in FIG. 1 from nucleotide number 1 to nucleotide number 4978. 

8. The method according to claim 6 in which the nucleotide sequence encoding HER3 comprises the nuc- 
leotide sequence substantially as depicted in FIG. 1 from nucleotide number 199 to nucleotide number 
4224. 

9. The method according to claim 6 in which the nucleotide sequence encoding HER3 comprises the nuc- 
leotide sequence substantially as depicted in FIG. 1 from nucleotide number 256 to nucleotide number 
4224. 
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Fig. 1 
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Fig. 1 (con'd) 
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1160 * *1170 1100 1130 

STLSSVGLSSVLGTEEEDEDEEYEtHIIRRRRKSRPKPPRP 
37*1 GGC6CXCTtTCTTC46TS6STCTCA0TTCT(5TCCTGWTACTSAAaUS4A<^TWW 

1240 12 SO 12*0 1270 

T # 0 E 0 Y E T n U R 0 R 0 S ft 0 9 C C 0 T A A M G A C t A S E 0 * ? E E * » * 

J951 ACTCCWTSAA«ACTA?WTATAT$AATCG«M£CAG*T^AC^^ 

1790 1290 1300 1310 

PQGPGHQAPHVH^AiLKTLftSLEA TO $AFOKPD*««SRLF 

fiOCl T7TCAQry^CTGGACATCAGGCCCCCCA?GTCXA^ 
1320 

*2fll raAAGGC7UTGttCAGAWoT^ 
till CCAGCCCCTTTTCCCCASTCCCASACi^^ 
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4C03 T7TiTCATC:TTAAAACUTTCTGHA^TACATATTATCTCATTTTACACAAAG«^TC9GGa 
«32 1 ACC7SA«CAAGGAGTTT6AGACCASCnAGCCAACATAGTAAGACCCCCATCTcTTt 
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